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ABSTRACT 

Continuum fitting uncertainties are a major source of error in estimates of the temperature-density 
relation (usually parametrized as a power-law, T oc A 7-1 ) of the inter-galactic medium (IGM) through 
the flux probability distribution function (PDF) of the Lyman-a forest. Using a simple order-of- 
magnitude calculation, we show that few percent- level systematic errors in the placement of the quasar 
continuum due to e.g. a uniform low-absorption Gunn-Peterson component, could lead to errors in 
7 of order unity. This is quantified further using a simple semi- analytic model of the Lya forest 
flux PDF. We find that under-(over-)estimates in the continuum level can lead to a lower (higher) 
measured value of 7. By fitting models to mock data realizations generated with current observational 
errors, we find that continuum errors can cause a systematic bias in the estimated temperature-density 
relation of (£(7)) ~ —0.1, while the error is increased to 07 re 0.2 compared to 07 re 0.1 in the absence 
of continuum errors. 

Subject headings: intergalactic medium — quasars: absorption lines — methods: data analysis 



1. INTRODUCTION 



Over the past two decades, 
forest in the line-of-sight to 
emerged as one of the most 
the high-r e dshift (z > 2) 



the Lyman-a (Lya) 
distant quasars has 
important probes of 
universe (see, e.g. 



Croft et all fl998t IMcDonald et al.l l2000t ICroft et alJ 



ai 

3; 



20021 : iZaldarriaga et al.l 120031: IMcDonald et al.1 120051) . 
This has been underpinned by theoretical advances 
that established the Lya forest as arising from neu- 
tral hydrogen embedded in a warm photo-ionized inter- 
galactic medium (IGM), tracing the density fluctua- 
tions due to gravitational instability in h ierarchica l 
cluster ing cosmological models (s e e, e.g.. iCen et al 
19941: iMiralda-Escude et al.1 119961 : iDave et al.1 1199 
Theuns et al.lll998D . 

In recent years, the Lya forest is increasingly being 
used to gain a more detailed understanding of the inter- 
galactic medium. In particular, the thermal history of 
the IGM holds the key to understanding hydrogen and 
He II reionization at z > 6 and z ~ 3, respectively. For 
an underlying density distribution, A.(x) — p(x)/p, the 
astrophysics of the IGM mediates the optical depth of 
the Lya forest. 

In the standar d photoionization equilib rium model of 
the Lya forest (jGunn fc Peterson 1965), the thermal 
properties of the IGM are usually described in terms of 3 
main parameters: the ionization rate of the photoioniz- 
ing background radiation, F, the temperature of the gas 
at mean density, To, and the temperature-density rela- 
tionshipQ approximated as a power-law, T = To A 7-1 . 

Various authors have placed constraints on the back- 
ground ionization rate, T, of the IGM us ing the effec- 
tive optical depth, T e g, of the Lya forest (|Bolton et al.1 
l2005tlFaucher-Giguere et al.ll2008al ) and the quasar prox- 
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1 The temperature-density relationship of the IGM is often called 
the 'equation of state'. This is technically incorrect as the equation 
of state of the IGM is that of a perfect gas, thus in this Letter we 
do not use this term. 



imity effect (jScott et al.l 120001: iDall'Aglio et al.l [2009). 
Constraints on To have been made using detailed line- 
profile fitting of individual Lya ab sorption lines from 
high-resolution sp ectra (see, e.g., iSchave et al.l 120001 : 
iBecker et aTll2011l ). 

Meanwhile, the probability distribution func- 
tion (PDF) of the tra nsmitted Lya forest flux 
([Jenkins k, Ostrikerl 119911 ) has been used to place 
constraints on the temperature -density rela t ion o f 
the IGM. Using t his t echnique. IBecker et afl (|2007f ). 
iBolton et all ((2008), and|Viel_et_alJ ((2009D have recently 
found evidence of a highly inverted temperature-density 
relation, 7 ~ 0.5, in the Lya forest at z ~ 2 — 3. While 
some theories (e.g., iFurlanetto fc Ohl 120081 ) predict a 
mildly-inverted temperature-density relation (7 re 0.8) 
as a consequence of inside-out He II reionization, the 
amount of energy that needs to be injected into the IGM 
to obtain 7 ~ 0.5 is inconsistent with the observational 
constraints on heati ng sources that those redshifts 
(|McQuinn et al.l 120091 ) . 

Although various systematics such as continuum fit- 
ting errors, noise and metal line contamination can bias 
the interpretation of the flux PDF, high-resolution (R = 
A/AA ~ 10 4 ) and high signal-to- noise (S/N > 50 per 
pixel) Ly a forest spectra can ameliorate these effects 
(see, e.g.. iKim et al.ll2007j) . With such data, metal lines 
in the Lya forest region can be directly identified and re- 
moved, while the low pixel noise allows precise determi- 
nations of the continuum from the peaks of the observed 
Lya transmission, with random errors as low as 1 — 2%. 

However, depending on the underlying properties of 
the IGM, a uniform Gunn-Peterson absorption compo- 
nent could exist in the Lya forest even at relatively 



While this term may be reminiscent of the obsolete pic- 
ture of the IGM as a two-phase medium consisting of dense and 
cool Lya clouds em bedded in a hot, tenuous inter-cloud medium 
llSargen t et al. 1980), in this paper we merely use this term to refer 
generically to a uniform or large-scale low-absorption component 
in the Lya forest transmission field. 
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low redshifts (z ~ 3). In such continuum fit- 

ted to the transmission peaks of the observed Lya for- 
est flux could underestimate the continuum, since few 
of the peaks would reach the true quasar continuum. 
This systematic continuum bias could exist even in high- 
S/N spectra which have small random errors in the pixel 
fluxes. Work e rs in this field are aware of this possibility: 
iBolton et afl (|2008[ ) tested this by fitting their flux PDFs 
with the continuum raised by 1.5% and 5 %, but found 
that t hese provided worse fits to their data. IBecker et al.l 
(|2007f) and IViel et al.l (|2009t) made the continuum level 
a free parameter in their likelihood analysis, although 
neither found a significant continuum bias. 

However, none of the aforementioned studies tried to 
directly estimate the amount of continuum bias at low- 
rcdshift (z < 3 — 4), nor is it well- understood how such a 
bias could affect measurements of the IGM temperature- 
density relation. 

In a paper me asuring the effective optical de pth T e s 
of the Lya forest, IFaucher-Giguere et al.l (|2008bf ) did at- 
tempt to constrain the amount of bias in their contin- 
uum fits by hand-fitting mock spectra derived from nu- 
merical simulations. They found that even at z = 3, 
continua fitted to the peaks of the transmission under- 
estimate the level of the continuum by 5%, although 
they assumed an extreme value of 7 = 1.6 in their 
mock spectra. While this was the temperature-density 
relation calculat ed for a relaxed IGM following hydro- 
gen reionization ()Hui fc Gnedin|[T997[) , it is not expected 
to be v alid during the epoch o f He II reionization a t 
z < 3 (jFurlanetto fc Ohl [20081 iMcQuinn et al.l [2009h . 
Since t c s = — ln((F}), a systematic bias in the estimated 
continuum level leads to errors in r c g at approximately 
the same level (for example, a 2% underestimate in the 
continuum would lead to roughly a 2% overestimate in 
T c ff). In contrast, such continuum errors would bias the 
estimated temperature-density relation in a more compli- 
cated manner, which is not well-understood. This short 
paper is intended to investigate this bias in a simple, 
easily-reproducible, manner. 

In Section [2] we first make a simple back-of-the- 
envelope calculation which indicates that systematic bi- 
ases in the continuum fitting of even a few percent could 
cause large errors in the estimated temperature-density 
relation. Section[3]then introduces a simple semi-analytic 
model of the Lya forest flux PDF to quantify this error, 
followed by a discussion (Section SJ of these biases and 
some methods to correct for them. 

2. SIMPLE ESTIMATES 

In this section we make an order-of-magnitude calcula- 
tion of the bias in the IGM temperature-density relation 
that could potentially arise from systematic errors in the 
placement of the quasar continuum level. We first define 
the fractional continuum error, 



fc 



C e: 



Cu 



(1) 



where Ct rU c is the true quasar continuum in the Lya 
forest, while C es t is the estimated continuum. 

The flux transmission through the IGM F is related to 
the intervening optical depth t by F — exp(— r), while r 
is in turn related to the underlying matter distribution, 
A(z) = p{x)/p. 



For this calculation we use the fluctua ting Gunn- 
Peterson approximation (FGPA, see, e.g., iCroft et ah! 

EUD, 

r = T A 2 - a , (2) 

where A = A(x), a = 0.7(7— l)j an d r o is a factor which 
includes astrophysics such as the Lya recombination co- 
efficient and the photoionizing UV background T. Here 
we assume ro to be constant, so that the only unknown 
parameter of the IGM is the exponent a in the equation 
of state. 
We rearrange Equation [2] to 



In r - In t 

a = — h 2. 

In A 

The derivative of a with respect to r is then 



da 



1 



■In A' 



(3) 



(4) 



This allows us to estimate the error in the derived equa- 



tion of state, Sa = a c 



- a 



(the superscripts 'est' and 



'true' refer to the estimated and true underlying quanti- 
ties respectively), arising from a systematic error in the 
measured optical depth St for a given matter overdensity 
A: 



Sa ' 



1 St 
In A r 



St 

T 



(5) 



In the second line, we have approximated — In A to be of 
order unity, since the low-absorption regions of the IGM 
are in the underdense (A < 1) parts of the universe. 

Now consider the effects of a misunderestimated quasar 
contin uum level C es t, where C t rue is t he true continuum 
level. IFaucher-Giguere et al.l (|2008bD have shown that 
the true optical depth r truc is related to the measured 
optical depth corresponding to the underestimated con- 
by 

c _cst true 
0T = T — T 



=\n\l 

— fct 



C es t — Ct ru 
Ctrue 



(6) 



where in the last line we have substituted in Equation Q] 
and assumed that it is small (|/ c | < 0.1). 

Inserting Equation [5] into Equation [51 and using the 
fact that in the low-absorption regions F — exp(— r) ~ 
1 — t, we get 

Sa ~ (7) 
1 — F w 

As the strongest constraints on the temperature- 
density relation of the IGM come from the low- 
absorption/high-transmission (1 — F ~ %) pixels of the 
Lya forest, systematic errors in the continuum estimate 
of just a few percent can cause large (£7 ~ Sa ~ 1) errors 
in the estimated value of 7. 

3. MOCK ANALYSIS 

In this section, we generate theoretical flux PDFs of the 
Lya forest using a simple semi-analytic model, and quan- 
titatively study the errors introduced into the estimated 
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Fig. 1. — Unbinned flux PDFs generated from Eq.|9] with 7 = 0.5 
(thin red dashed line) and 7 = 1.6 (thin red solid line). The flux 
PDFs are normalized to (F) = 0.70. The thick black lines denote 
the PDFs after being smoothed by a Gaussian kernel corresponding 
to S/N = 50 in our simplified noise model. 




Fig. 2. — (Upper panel) Binned flux PDFs generated from Eq. [9] 
with different temperature-density relations, 7 = 0.5 (green dashed 
line), 7 = 1.0 (blue dotted line), 7 = 1.2 (black solid line), and 
7 = 1.6 (red dot-dashed line). The lower panel shows the ratio 
of the various flux PDFs with respect to the 7 = 1. model. The 
relative differences are similar to those presented in lBolton et al.l 
12003) . 

temperature-density relation caused by systematic errors 
in the Lya forest continuum. 

3.1. Toy Flux PDFs 

We first generate theoretical flux PDFs by using as 
a starting point the semi-a nalytic density PDF from 
iMiralda-Escude et al.l (|2000f) (hereafter MHR00): 



p(A)dA = A exp 



-(A 



-2/3 



C 



,21 



2(2<5 /3) 2 



(8) 



where A is the matter density, while A, /3, Co, and So 
are redshift-dependent parameters interpolated from the 
values published in MHROO. At z = 3, the corresponding 
values are A = 0.558, j3 = 2.35, C = 0.599, and 5 = 
1.90. 



The corresponding Lya forest flux PDF is then derived 
by substituting in A(r) using the FGPA (Equation [2]) 
and then the relation F = exp (— t): 



p(F)dF-- 



dF 



A 



F r (2 



a 



x exp 



hxF 

TO 

\nF/r ) 



Co 



2(2V3) 2 



.(9) 



This is then normalized such that J p(F)dF = 1. 

The parameters in Equation [9] that characterize the 
IGM are a and To- To is a function that depends on 
the background photoionization rate in the IGM and the 
temperature at mean density, but for the purposes of this 
paper we adjust it to yield a fixed value of mean optical 
depth (F)(z) = exp (— T e g(z)) for a given redshift. Thus, 
at fixed redshift the only free parameter in this model 
is 7 = 1 + a/0.7 which parametrizes the temperature 
density equation-of-state. 

Note that this model does not correctly account for 
thermal broadening of the Lya fore s t line s, nor pecu- 
liar velocities. However. iBolton et al.l (|2008| ) have shown 
from hydrodynamical simulations that the flux PDF is 
not sensitive to To, which affects the Lya forest pri- 
marily through thermal broadening and by changing the 
Jean's smoothing scale. Peculiar velocities need to be 
included to obtain the correct flux PDF shape, but they 
arise primarily from gravitational collapse of large-scale 
structure. Thus, we do not expect the omission of pecu- 
liar velocities to seriously affect the relative behavior of 
the flux PDF with changes to the temperature-density 
relation. 

We show the flux PDFs calculated for 2 extreme values, 
7 = 0.5 and 7 = 1.6 in Figure[TJ 7 = 0.5 corresponds to a 
highly inverted te mperature-density relation, which ha d 
been detected by (|Becker et alj2007t IBolton et alf2 008). 
While 7 = 1.6 is a value calculated bv iHui fc Gnedinl 
(|1997f ) for a post-reionization relaxed IGM, which should 
be valid in the epoch between the end of hydrogen reion- 
ization and prior to He II reionization, 3 < z < 6. 

Figure Q] illustrates the effects of changing 7: at fixed 
temperature at mean density To, lowering 7 increases the 
temperature in the underdense (A < 1) regions of the 
IGM. This decreases the hydrogen recombination rate, 
therefore reducing the H I optical depth and increasing 
the transmission in those regions. The differences are ap- 
parent in the F > 0.5 regions of the PDF, where the peak 
is shifted to higher F. The probability of having pixels 
with close to 100% transmission decreases as 7 goes up; 
at 7 = 1.6, the probability drops to zero by F ~ 0.98 (in 
our model) thus leading to Gunn-Peterson absorption at 
the 2% level. In this case, the lack of transmission peaks 
reaching the true continuum level is likely to lead to an 
underestimate of the continuum, potentially biasing the 
estimate of 7. Note that the moderate-absorption re- 
gions of the PDF (0 .2 < F < 0.6) do no t vary much with 
7. This agrees with IBolton et all (|2008| ). who found that 
using only these moderate-absorption pixels significantly 
weakens constraints on 7. 

The effects of finite signal-to-noise are introduced by 
smoothing the flux PDF from Equation [9] with a Gaus- 
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sian kernel. We use a constant smoothing length of 
(7 sm = 1/50 to simulate an average S/N = 50 per pixel, a 
typical value for high-resolution and high-S /N Lya forest 
sp ectra. This is an accept able simplification, as Table 3 
in iMcDonald et al.l (|2000| ) shows that the typical pixel 
noise in real data is roughly constant across the different 
flux bins. Note that the presence of noise scatters some 
pixels to F > 1, changing the shape of the flux PDFs 
(black curves in Figure [TJ . 

We then bin the fl ux P DFs in th e fash ion of 
IMcDonald et al.l (|2000t l and iKim et al.l (|2007| ): the 
'noisy' flux PDF is divided into 21 bins with size AF = 
0.05 in the range < F < 1. The 'noisy' portions of the 
flux PDF with F < and F > 1 are transferred to the 
F = and F = 1 bins, respectively. 

In Figure [21 we show the binned PDFs for different 
temperature-density relations. The overall trend of the 
unbinned PDFs are similar to that in the unbinned case: 
the high-transmission peak of the PDF shifts to larger 
F as 7 is decreased. In addition, the number of pixels 
in the F = 1.0 bin increases with decreasing 7. The 
overall shape of the flux PDFs are in broad agreement 
with other theore tical flux PDFs published in the lit- 
eratur e, inc luding IMcDonald et al.l fj2000f ) . iBolton et al.l 
l|2008fh and lWhite et al.l (|2010ft r~However. note that we 
are unable to repr oduce the exact shapes of the flux 
PDFs as seen in the IKim et all (120071) data, n or fro m hy- 
drodynamical simulations (e.g.. IBolton eTaDHOOl): our 
PDFs tend to peak at lower values of F for a given value 
of 7, while the 7 = 0.5 PDF here does not have the 
sa me shape in th e high-transmission end obtained from 
the IBolton et a l. (2008) hydrodynamical simulations, in 
which the PDF peaks in the F = 1.0 bin. The MHR- 
FGPA model also appears to underpredict the number 
of l ow-transmission pix els {F 0) in comparison with 
the IBolton et al.l (|2008D simulations. 

In the lower panel of Figure ; 2|, we plot the ratio of the 
flux PDFs with respect to that with 7 = 1.2. These ap- 
pear similar to the analogou s plots shown in the lower 
panels of Figures 2 and 4 in IBolton et~aTl (120081 ). The 
main differences are that the flux PDFs in the hydrody- 
namical simulations pivot at F « 0.1, while those in our 
model pivot at a higher value of F « 0.3. Nevertheless, 
the relative behavior of our model PDFs with r espect 
to 7 appear similar to those in the IBolton et al.l (2008) 
hydrodynamical simulations. Since the primary effect of 
systematic continuum errors is to rescale the flux PDF 
along the abscissa, the fact that our toy model correctly 
reproduces the relative changes in the PDF with respect 
to 7 means that it can be used to study continuum errors, 
even if it should not be used to make direct comparisons 
with data. 

3.2. Effect of Continuum Errors 

In this section, we study the effect of systematic con- 
tinuum biases on the value of 7 measured from the mock 
flux PDFs described above, and the corresponding er- 
rors on this estimate. This mock analysis is carried out 
at a fixed redshift of z = 3, an epoch at which Gunn- 
Peterson absorption is usually assumed to be negligi- 
ble, but co uld account for as much as 4-5% of the ob- 
served flux (|Giallongo et al.|[l992t iFaucher-Giguere et al.l 
l2008bl) . All mock PDFs are set to a me an flux of 
(F)(z = 3) = 0.70 (|Meiksin fc White! 12003) . and the 




Fig. 3. — The effect of systematic continuum errors on the Lya 
forest flux PDF. The top panel shows the unbinned flux PDFs 
generated with 7 = 1.2 and S/N = 50, with no continuum error 
(black line), 2% overestimated continuum (dotted line), and 2% 
underestimated continuum(dashed line). The lower panel shows 
the corresponding flux PDPs in AF = 0.05 bins. 2% errors in the 
continuum determination can drastically change the shape of the 
binned flux PDF. 
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Fig. 4. — The effect of changing To (or equivalently (F)) on the 
flux PDF. The top panel shows the unbinned flux PDFs generated 
with 7 = 1.2 and S/N = 50, with to adjusted to match the ef- 
fective mean-flux of the PDFs with the corresponding line-styles 
in Figure [3] The lower panel shows the corresponding flux PDFs 
in AF = 0.05 bins, to changes the flux PDF differently from the 
temperature-density relation or continuum errors. No continuum 
errors have been applied in these PDFs. 



model parameters from Equation [5] are set to z = 3. 

Systematic continuum errors are introduced into the 
flux PDFs by multiplying the flux scales with the factor 
1 + / c prior to binning — positive values of f c denote 
an overestimate with respect to the true continuum, and 
vice versa. The dotted and dashed curves in Figure |3] 
illustrate the effects of these errors on a flux PDF with 
7 = 1.2. Clearly, just 2% systematic errors in the con- 
tinuum estimation can dramatically change the shape of 
the binned flux PDFs: with a 2% underestimate of the 
continuum, the number of unabsorbed pixels (F = 1.0) 
increases by a factor of two. Note that the continuum 
errors also change the measured mean- flux (F). For the 
cases with f c = [—0.02, 0, 0.02], the effective mean-fluxes 
are (F) = [0.723,0.699,0.669], respectively. The scatter 
in these mean-flux values are at the same level as the 
current observational uncertainty of several percent at 
z « 3. 

To investigate if the mean-flux might be degenerate 
with 7 and / c , in Figure [4] we show 7 = 1.2 flux 
PDFs with To adjusted to give (F) values correspond- 
ing to the curves in Figure EH without continuum er- 
rors included. The corresponding values used to gen- 
erate these PDFs are r = [1.003, 1.185, 1.451] for (F) = 
[0.723,0.699,0.669], respectively. We see that a smaller 
To or higher (F) can shift the high-transmission peak 
of the flux PDF to larger F, analogous to lowering 
7. However, the relative ratio of high-absorption/low- 
transmission pixels (F < 0.85), with respect to the low- 
absorption/high-transmission end (F > 0.85) behaves 
differently with changes in To compared to when 7 or 
f c is varied (c.f. Fig. [5] and Fig. [3]). This suggests that 
the degeneracy of To with respect to 7 and f c can be bro- 
ken in an analysis of the full flux PDF, but nevertheless 
To needs to be taken into account. 

To quantify the effects on the estimated temperature- 
density relation, the flux PDFs with different combina- 
tions of [7, f c ] are then compared with mock data PDFs 
with similar noise properties to the lKim et al.l ([2007D flux 
PDFs, which is the best existing data set. However, the 
bins of the flux PDF are highly correlated, thus one needs 
a covariance matrix in order to generate random sam- 
ples with the correct covariances. As our semi-analytic 
model does not actually result in simulated Lya forest 
sightlines, it is problematic to estimate the necessary co- 
variances. Instead, we use the same cov ariance matrices 
described in §4.1 of IBolton et~aTl (|2008f) : the covariance 
matrix (kindly provided by James Bolton) derived from 
their z = 2.94,7 = 0.44 simulations arc used to esti- 
mate the correlation coefficients between different flux 
bins. This is then used in conjunctio n with the errors 
(i.e. diagonal covariance terms) in the IKim etaLl (l2007f) 
data to estimate the cross-terms of the final covariance 
matrix. We then carry out Cholesky decomposition on 
this covariance matrix to generate random correlated er- 
rors, which are then applied to the model flux PDFs. 
This process assum es that all the mode l covariances are 
the same as those in IBolton et al.l (120081 ) . but this should 
be reasonable considering the approximate nature of our 
analysis. 

We can then quantify the effects of continuum bias by 
sampling the reduced Chi-squared between each mock 
data PDF, and different model curves with various com- 
binations of 7 and fc- Since the bins are highly corre- 
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Fig. 5. — Likelihood plots for different combinations of 7 and / c , 
computed with respect to a mock flux PDFs generated with 7 = 
f.0. The contours enclose the 68% and 95% confidence intervals, 
while the color scale denotes A\ 2 = \ 2 ~ X 2 ■ The square and 
cross denote the extreme values of 7 plottecl in Figure \E\ which 
constitute acceptable ~ lcr fits to the 7 true = 1.0 mock flux PDF 
(also plotted in Figure |6j once continuum errors are considered. 
The white horizontal arrow denotes the 68 % confidence interval 
for 7 when the continuum is perfectly known, i.e. f c = 0.. 

lated, the expression for the Chi-squared is: 

X 2 = [dt - PT^fC^ldi - P r dcl ] (10) 

where di are the bins in the mock data PDFs, p™ odcl is 
the model flux PDF with some combination of 7, f c and 
To, while Cij is the covariance matrix discussed above. 
The confidence level of the various parameter combina- 
tions can then be estimated from A\ 2 = x 2 — Xmin- 

In our analysis, we regard the Lya forest mean- 
flux, (F), (or equivalently within our model, To) as 
a nuisance parameter to be marginalized over. At 
each point in the [7, f c ] model parameter space, we 
marginalize over the mean-flux in the likelihood, C = 
[(2n) N det Qj] exp{ — (l/2)x 2 }. If we assume the error 
in the mean-flux is Gaussian, we can then marginalize 
over the possible mean- flux values (F)': 

£(7,/ c |(F))(x / £(7, f c , (F)') 

V a {F) ) 

where we have used (F) = 0.70 and a/p\ = 0.02 for 
z = 3. Note that we have ignored normalization factors 
that will be canceled out when we evaluate A% 2 , and 
have evaluated the integral using 7-point Gauss-Hermite 
quadrature. 

Figure [5] shows the likelihood plot for fits to a mock 
data PDF generated with 7'™° = 1.0, computed with 
respect to different combinations of 7 and f c - The con- 
tours indicates the 68% and 95% confidence intervals 
(Ax 2 = 2.30 and A% 2 = 6.17, respectively). The er- 
ror ellipse shows a significant degeneracy between 7 and 
f c - For reference, we also show the 68% confidence level 
(A% 2 = 1) in the case where the continuum is not al- 
lowed to vary (horizontal white arrows). The range of 7 
which falls within the 68% confidence interval increases 
significantly to a 1 = 0.2 — 0.3 once the continuum level 
is allowed to vary. Without continuum errors, the erroiQ 

3 We use the terms 'lcr errors' and '68% confidence intervals' 
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Fig. 6. — Flux PDFs corresponding to Figure [5] The error bars 
denote the mock 7 = 1.0 flux PDF with simulated scatter, against 
which the likelihoods in Figure[5]were calculated. The dashed- and 
dotted lines denote model PDFs with [7 = 0.75, f c = -0.007] and 
[7 = 1.2, f c = 0.006], which constitute acceptable (Ax 2 ~ 2, 68% 
confidence level) fits to the mock 7 truc = 1.0 data points. Their 
respective positions in [7, f c ] space are shown as the square and 
cross in Fig[5] Note that the ordinate axis is plotted logarithmically 
in this figure. 

in the estimated 7 is 07 » 0.05. 

The square and cross symbols in Figure [5] denote two 
extreme points in the [7, f c ] parameter space that fall 
within the 68% confidence interval of the underlying flux 
PDF with 7 truc = 1.0. The binned flux PDFs for these 
two combinations of [7, f c ] are juxtaposed with the un- 
derlying PDF in Figure |B] The similarity of these PDFs 
with significantly different temperature-density relations 
(7 = 0.75 — 1.2) illustrates the effect of continuum errors 
on the measurement. 

Figure [7] shows the average estimated value of 7 as 
a function of the 'true' underlying temperature-density 
power-law 7 truo and its associated errors, averaged over 
40 Gaussian realizations for each value of 7 truo . The 
dotted lines denote the average 68% confidence inter- 
vals if systematic continuum errors are considered, while 
the shaded area show the la intervals if the contin- 
uum is known perfectly. We see that continuum errors 
cause a significant increase in the error on 7: at higher 
temperature-density slopes (7 ~ 1.5) the errors increase 
from 07 w 0.05 to er^ ont w 0.15; at flatter or inverted 
slopes, the errors are generally larger but still increase 
from 07 w 0.1 to <7™ nt « 0.2. In general, the errors in- 
crease by a factor of ~ 2 once we consider continuum 
errors. 



4. DISCUSSION & CONCLUSION 

In this paper we have quantified the effect of system- 
atic continuum errors on the temperature-density rela- 
tion, 7, estimated from the Lya forest flux PDF, using a 
simple toy model which correctly reproduces the relative 
changes in the flux PDF with respect to 7. 

We found that small systematic errors of just ~ 1 — 2% 
in the overall continuum level can bias the estimated 7 
to smaller values. In the absence of continuum errors, 
the 1<7 errors on the estimated value of 7 is <t 7 ~ 0.1, 
but with continuum errors this interval is increased to 
07 « 0.2. 

interchangeably but the latter is the quantity we have really mea- 
sured. These two terms would be identical in the case of Gaussian 
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Fig. 7. — The la upper- and lower- limits in the value of 7 esti- 
mated from mock flux PDFs generated with some underlying value 
of 7 truc , when continuum biases are taken into account (dotted 
lines) and if the continuum is perfectly known (shaded area en- 
closed by solid lines). Continuum uncertainties roughly double the 
error on 7. These values are averaged over 40 mock data realiza- 
tions for each value of 7 truc . 




Fig. 8. — A Lya forest sightline from the Ccn & Chisari (2010) 
hydrodynamical simulations extracted at z = 3.0 and smoothed 
to a resolution of 6.7kms — 1 . The dashed horizontal line indicates 
the F = 0.95 flux level. Even though no pixel noise is present, 
an attempt to fit the continuum to this mock spectrum will likely 
result in a continuum bias of f c —0.05. Courtesy of R. Cen. 

While we have used a simple semi-analytic model 
to calculate the flux PDF s (although note th at this is 
the same model used by iBecker et al.l ()2007f ) to ana- 
lyze their spectra), the relative scaling of the resultant 
flux PDF with 7 is reasonably accurate and thus the bi- 
ases discussed here are qualitatively valid. We have also 
marginalized over the mean-flux in our analysis. 

In addition, in the simple mock analysis presented here, 
both the data realizations and 'theory' flux PDFs are de- 
rived from exactly the same model. In a real data anal- 
ysis, uncertainties in the underlying physical model (e.g. 
gas temperature, UV ionizing background, erg, Jeans' 
smoothing scale etc.), and other observational uncertain- 
ties such as metal contamination must increase the error 
in 7 beyond those presented here. 

Another point that should be emphasized is that prior 
to the biases in 7 considered in this paper, systematic 
continuum errors are not a symmetric effect. Overes- 
timates of the continuum (/ c > 1) are less likely with 
high-quality data — an observer is unlikely to place the 
continuum level significantly above the observed trans- 
mission peaks, whereas low-level Gunn-Peterson absorp- 
tion can degrade the transmission peaks and lead to 
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underestimates of the continuum (/ c < 0). There is 
thus an additional bias towards smaller values of 7 due 
the higher probability of underestimating the continuum 
rather than overestimating it. In the MHR00 model con- 
sidered here, the maximum amount of possible Gunn- 
Peterson absorption is in fact fairly limited at z = 3: 
about 2% with 7 = 1.6 (Figure [TJ. Other models could 
provide even more Gunn-Peterson absorption at these 
rcdshifts. An example is shown in Figure which plots 
az = 3 Lya forest spectrum extracted fr om the detailed 
hydro dynamical simulations described in ICen fc Chisaril 
(pOlOh . If this were actual data, an observer would prob- 
ably underestimate the spectrum by 5% even in the ab- 
sence of noise. 

Recent studies of the flux PDF that clai med a highly 
inver t ed temperature-d e nsity relation (| Becker et al.l 
120071 iBolton et all 120081 : IViel et al.1 120091 ) have consid- 
ered t he possibility of a sy stematic continuum bias. How- 
ever, I Becker et al.l ()2007[ ) used the same semi-analytic 
MHR00 model for the flux PDF used in this paper, 
which may not accurately capture the details of the Lya 
forest flux field at a su fficiently level for data analysis. 
IBolton et all (|2008l ) and IViel et al.l (l2009fl both analyzed 
the same data set from iKim et al.l (120071 ): IBolton et al.l 



(2008) checked for continuum errors by comparing their 
calculating likelihoo ds after rescaling their continua by 
1.5% and 5%, while IViel etaLI (|200l marginalized the 
continuum in their analysis — both concluded that con- 
tinuum errors were not significant and that the inverted 
temperature-density relatio n was favor e d. Ho wever, it is 
also interesting to note that IViel et all (|2009f ) arrived at 
a best-fit continuum err or suggesting an overestimated 
continuum of 1% in the IKim et a l. (2007) data (c.f. the 
random errors in the continuum fitting, a&t = 1 — 2%). In 
other words, they favor a continuum w hich is lower than 
that which was actually fitted to the IKim eTaTl (120071 ) 
spectra. In the context of our analysis, this would be con- 
sistent wit h an u nderestimation of 7. We also note that 
IViel et aT] (|2009f ) had fitted for a re dshift-independent 
continuum error when analyzing the IKim et all ()2007f ) 
flux PDFs. Since this data set is dominated by lower- 
rcdshift (z < 2.5) spectra (which should be less affected 
by low-level Gunn-Peterson absorption), it is possible 
that the true continuum error cannot be fully accounted 
for, using a redshift-independent approach. 

Considering the controversial nature of the claims of 
a highly-inverted IGM temperature-density relation, we 
feel a more direct approach towards dealing with con- 
tinuum bias is required that had been done previo usly. 
There are some possibilities: iGiallongo et al.l (|1992l ) had 



extrapolated a power-law continuum from redwards of 
the quasar Lya emission line to estimate the amount of 
uniform GP absorption in the peaks of the Lya forest. 
Existing data sets will allow much stronger constraints 
to be placed using this method, although it requires the 
assumption that the mean quasar continuum slope does 
not change with redshift. 

Alternatively, when comparing simulations with data, 
the simulated sightlines need to be processed through 
the same continuum-fitting metho d as the observed data, 
i.e. thr ough 'forward- modeling'. iFaucher-Giguere et al.l 
(2008b) estimated continuum biases by fitting simulated 
mock spectra, and comparing these fits with the underly- 
ing continuum. However, they were attempting to mea- 
sure the optical evolution of the Lya forest, and used sim- 
ulations with a fixed value of 7 = 1.6. For this method 
to account for continuum biases in studies exploring a 
large parameter space, one would need to hand-fit large 
sets of mock spectra (ideally including realistic quasar 
continua) covering the explored parameter space. This 
would be time-consuming, but in principle would be a 
robust method to account for systematic errors in the 
continuum fits, esp ecially if automated co ntinuum-fitting 
methods are used (jDairAglio et al.ll2009l ). 

Another avenue for improvement to use larger Lya for- 
est data sets, and hence redu ce the errors in the flux 
PDF and other statistics. The IKim et al.l (|2007f) sample 
(18 quasars) repres ent a significant increas e in data size 
m comparison with iMcDonald et al.l (|2000f ) (8 quasars), 
but considerably more high-resolution, high-S/N spec- 
tra than these currently exist. Increasing the sample 
size would clearly reduce the errors in the measured flux 
PDF, which would limit the scope of continuum errors 
to bias the measured temperature-density relation. 

At time of writing, the measurement of the inverted 
temperature-density relat ion has been carried out by the 
same group of authors (|Bolton et all 120081: IViel et al ' 
2009) analyzing the same flux PDF data (|Kim et al 
2007). Alternative and independent analyses are ur- 



gently required in order to verify this phenomenon, which 
would have important implications on IGM science as 
well as energetic sources in the high-redshift universe. 
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